Back-Propagation as Reinforcement in Prediction Tasks
نویسنده
چکیده
The back-propagation (BP) training scheme is widely used for training network models in cognitive science besides its well known technical and biological short-comings. In this paper we contribute to making the BP training scheme more acceptable from a biological point of view in cognitively motivated prediction tasks overcoming one of its major drawbacks. Traditionally, recurrent neural networks in symbolic time series prediction (e. g. language) are trained with gradient decent based learning algorithms, notably with back-propagation (BP) through time. A major drawback for the biological plausibility of BP is that it is a supervised scheme in which a teacher has to provide a fully specified target answer. Yet, agents in natural environments often receive a summary feed-back about the degree of success or failure only, a view adopted in reinforcement learning schemes. In this work we show that for simple recurrent networks in prediction tasks for which there is a probability interpretation of the network’s output vector, Elman BP can be reimplemented as a reinforcement learning scheme for which the expected weight updates agree with the ones from traditional Elman BP, using ideas from the AGREL learning scheme (van Ooyen and Roelfsema 2003) for feed-forward networks. Reinforcement learning where the teacher gives only feed-back about success or failure of an answer is thought to be biologically more plausible than supervised learning since a fully specified correct answer might not always be available to the learner or even the teacher ([1], especially for biological plausibility [2]). In this article we extent the ideas of the AGREL scheme [3] about how to implement back-propagation (BP) in feed-forward (FF) networks for classification tasks to encompass Elman BP for simple recurrent networks (SRNs) in prediction tasks [4]. The results have relevance especially for the cognitive science community for which SRNmodels have become an important tool [5], since they improve the standing of SRNs with respect to biological and cognitive plausibility.
منابع مشابه
Prediction of methanol loss by hydrocarbon gas phase in hydrate inhibition unit by back propagation neural networks
Gas hydrate often occurs in natural gas pipelines and process equipment at high pressure and low temperature. Methanol as a hydrate inhibitor injects to the potential hydrate systems and then recovers from the gas phase and re-injects to the system. Since methanol loss imposes an extra cost on the gas processing plants, designing a process for its reduction is necessary. In this study, an accur...
متن کاملOn the use of back propagation and radial basis function neural networks in surface roughness prediction
Various artificial neural networks types are examined and compared for the prediction of surface roughness in manufacturing technology. The aim of the study is to evaluate different kinds of neural networks and observe their performance and applicability on the same problem. More specifically, feed-forward artificial neural networks are trained with three different back propagation algorithms, ...
متن کاملEvaluation of Ultimate Torsional Strength of Reinforcement Concrete Beams Using Finite Element Analysis and Artificial Neural Network
Due to lack of theory of elasticity, estimation of ultimate torsional strength of reinforcement concrete beams is a difficult task. Therefore, the finite element methods could be applied for determination of strength of concrete beams. Furthermore, for complicated, highly nonlinear and ambiguous status, artificial neural networks are appropriate tools for prediction of behavior of such states. ...
متن کاملArtificial Neural Network Based Prediction Hardness of Al2024-Multiwall Carbon Nanotube Composite Prepared by Mechanical Alloying
In this study, artificial neural network was used to predict the microhardness of Al2024-multiwall carbon nanotube(MWCNT) composite prepared by mechanical alloying. Accordingly, the operational condition, i.e., the amount of reinforcement, ball to powder weight ratio, compaction pressure, milling time, time and temperature of sintering as well as vial speed were selected as independent input an...
متن کاملANN Based Modeling for Prediction of Evaporation in Reservoirs (RESEARCH NOTE)
This paper is an attempt to assess the potential and usefulness of ANN based modeling for evaporation prediction from a reservoir, where in classical and empirical equations failed to predict the evaporation accurately. The meteorological data set of daily pan evaporation, temperature, solar radiation, relative humidity, wind speed is used in this study. The performance of feed forward back pro...
متن کامل